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TRAVELING WAVES OF SELECTIVE SWEEPS 

By Rick Durrett-*^ and John Mayberry^ 

Duke University and University of the Pacific 

The goal of cancer genome sequencing projects is to determine the 
genetic alterations that cause common cancers. Many malignancies 
arise during the clonal expansion of a benign tumor which motivates 
the study of recurrent selective sweeps in an exponentially growing 
population. To better understand this process, Beerenwinkel et al. 
[PLoS Comput. Biol. 3 (2007) 2239-2246] consider a Wright-Fisher 
model in which cells from an exponentially growing population accu- 
mulate advantageous mutations. Simulations show a traveling wave 
in which the time of the first fc-fold mutant, Tk, is approximately 
linear in k and heuristics are used to obtain formulas for ETk- Here, 
we consider the analogous problem for the Moran model and prove 
that as the mutation rate — >■ 0, ~ log(l//i), where the Ck can 
be computed explicitly. In addition, we derive a limiting result on a 
log scale for the size of Xk (t) = the number of cells with k mutations 
at time t. 

1. Introduction. Recent studies have sought to identify the mutations 
that give rise to common cancers by sequencing protein-coding genes in 
common tumor types, including breast and colon cancer [23, 26], pancreatic 
cancer [13] and glioblastoma [19, 24]. The last study is part of a 100 million 
dollar pilot project of the NIH, which could lead to a 1.5 billion dollar 
effort. These studies have rediscovered genes known to play a role in cancer 
(e.g., APC, KRAS and TP53 in colon cancer), but they have also found 
that tumors contain a large number of mutations. Analysis of 13,023 genes 
in 11 breast and 11 colorectal cancers in Sjoblom et al. [23] revealed that 
individual tumors accumulate an average of ~90 mutated genes, but only a 
subset of these contribute to the development of cancer. 
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Follow-up work in Wood et al. [26] studied 18,191 distinct genes in the 
same 22 samples. Any gene that was mutated in a tumor but not normal 
tissue was analyzed in 24 additional tumors, and selected genes were further 
analyzed in 96 colorectal cancers. Statistical analysis suggested that most of 
the ~80 mutations in an individual tumor were harmless and that <15 were 
likely to be responsible for driving the initiation, progression or maintenance 
of the tumor. These two types of mutations are commonly referred to as 
"drivers" and "passengers." The latter provide no selective advantage to the 
growing cancer mass, but are retained by chance during repeated rounds of 
cell division and clonal expansion (exponential growth). 

The results of [23] and [26] are in contrast with the long-held belief that 
most cancers are the end result of a handful of mutations. Armitage and 
Doll [1] constructed log-log plots of cancer mortality versus age and found 
slopes of 5.18 and 4.97 for colon cancer in men and women, respectively. 
From this they predicted that the occurrence of colon cancer was the result 
of a six-stage process. In essence their argument is that the density function 
of the sum of six exponentials with rates /Xj is 

^ fii ■ ■ ■ fiet^ / 5\ for small t. 

This result yields the density of the well-known gamma distribution when 
all the /ij are equal, but only readers with well developed skills in calculus 
(or complex variables) will succeed in deriving this result for unequal /Xj. 

The work in [1] and the subsequent work of Knudson [14], who used 
statistics to argue that retinoblastoma was the end result of two mutations, 
gave rise to a large amount of work; see [15] and the books by Wodarz and 
Komarova [25] and Frank [10] for surveys. From this large body of work 
on multistage carcinogenesis, we will cite only two sources. Luebeck and 
Moolgavakar [17] used multistage models to fit the age-specific incidence of 
colorectal cancers in the SEER registry (which covers 10% of the US popu- 
lation) to conclude that a four-stage model gives the best fit. Calabrese et 
al. [5] used data for 1022 colorectal cancers to argue that "sporadic" cancers 
developed after six mutations, but that in the subgroup of individuals with 
strong familial predispositions, only five mutations were required. 

There is good reason to doubt some of the conclusions of [23] and [26]. 
First, the statistical methods of [23] have been criticized (see letters on 
pages 762-763 in the February 9, 2007 issue of Science). Furthermore, in 
[26], a follow-up study on 40 of the 119 highest scoring genes, chosen be- 
cause they were in pathways of biological interest, showed that 15 of the 
40 genes (37.5%) were not mutated in any of the 96 tumors, casting doubt 
on the claimed 10% false discovery rate. However, the more recent studies 
[13, 19, 24] using well-known and trusted statistical methods have found 
similar patterns: an average of 63 mutations in pancreatic cancers and 47 in 
glioblastoma. 



TRAVELING WAVES OF SELECTIVE SWEEPS 



3 



To better understand this process by which an exponentially growing cell 
mass accumulates driver and passenger mutations, and, in particular, to un- 
derstand the data in [23], Beerenwinkel et al. [3] considered a Wright -Fisher 
model with selection and mutation in an exponentially growing population. 
They assumed that there were 100 potential driver genes and asked for the 
waiting time until one cell has accumulated k mutations. Simulations (see 
their Figure 3) showed that a traveling wave developed, in which the time 
until the first /c-fold mutant was approximately linear in k, and the authors 
used heuristic arguments to obtain quantitative predictions for the first time 
that a cell with k mutations appears. 

Here we will consider this problem for the analogous Moran model and 
prove asymptotic results as the mutation rate — )• for the behavior of 
Xj{t) = the number of cells with j mutations at time t. A cell with j muta- 
tions will be referred to as a type-j individual. Our main result is Theorem 2, 
which allows for an exponentially growing population N^{t) of individuals. 
The process of fixation of advantageous mutations in a population of con- 
stant size has been the subject of much theoretical work (see, e.g.. Chapter 
6 of [7]), so it is natural to ask how the behavior changes in an exponen- 
tially growing population. A second difference from the standard theory of 
the fixation of a single mutation is that we consider a situation in which 
new mutations arise before older ones have gone to fixation, a process often 
referred to as "stochastic tunneling." The resulting "Hill-Robertson" inter- 
ference (see, e.g.. Section 8.3 in [7]) can be analyzed here because only the 
newest mutation is stochastic while the older mutations behave determinis- 
tically. This idea was used by Rouzine et al. in [21] (and later developed in 
more detail in [4, 20]) as a heuristic, but here it leads to rigorous results. 

The rest of the paper is organized as follows. In Section 1.1 we begin with 
a fixed population size of = individuals and state Theorem 1, which 
says that when time is scaled by L = log(l//u), the log sizes of Xj, divided 
by L, converge to a limit that is deterministic and piecewise linear and so 
the time the first type-j individual appears is 0(log(l//i)). Since we have 
assumed the population size is fJ.~°', this time scale agrees with (i) results in 
[27, 28], which show that the rate of adaptation (defined as the change in 
the mean fitness of the population) for a related fixed-population-size Moran 
model is O(logA^) and (ii) simulations in [6] which suggest that the speed 
of adaptation depends logarithmically on both the mutation rate and the 
population size. In Section 1.2 we return to the growing population scenario 
and state our main result. Theorem 2, which generalizes Theorem 1. Section 
2 contains some examples elucidating the nature of the limit in Theorem 2 
and illustrating the traveling-wave-like behavior of the limit. We state the 
main tools used to prove Theorem 2 in Section 3, and Sections 4 and 5 
contain the technical details. 
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1.1. Fixed population size: Main result. We begin by considering our 
Moran model in a fixed population of N individuals and return to our anal- 
ysis of the exponentially growing population in Section 1.2. We assume that: 

(i) initially, all individuals are of type 0; 

(ii) type-j individuals mutate to individuals of type j + 1 at rate fi; 

(iii) all individuals die at rate 1 and, upon death, are replaced by an 
individual of type j with probability 

{i + jyx^{t) 

Wt'it) 

where (1 + 7)-^ is the relative fitness of type-j individuals compared to type-0, 
and 

w'^{t) = ^ii + jyxt{t) 

is the "total fitness" of the population. We assume throughout that 7 > 
is fixed (i.e., mutations are advantageous). Approximations of the time the 
first type-A: individual appears have been carried out for the neutral case 
(7 = 0) in [9, 11, 12, 22] (and applied to regulatory sequence evolution in 
[8]). The case 7 < is of interest in studying Muller's ratchet [18], but since 
deleterious mutations behave very differently from advantageous mutations, 
we will not consider this case here. 

We will suppose throughout that N ^1/ ^, that is, N ix — > cxd. If N ^ — )• 0, 
then the I's arise and go almost to fixation before the first mutation to a 2 
occurs, so the times between fixations are independent exponentials. We will 
not here consider the borderline scenario, although we note that in the case 
when N fji ^ ci > and A^7 — >■ C2 > 0, the limiting behavior of the system has 
been well studied and we obtain a diffusion limit describing the evolution of 
type-j frequencies (see, e.g.. Sections 7.2 and 8.1 in [7]). Let Tq = 0, and for 
j > 1 define 

Tf =inf{t>0:Xj^(t)>l}, 

that is, is the time of the first appearance of a type-j individual. In order 
to study the birth times we will prove a limit theorem for the sizes of the 
X'^{t) on a log scale. Let log"*" x = maxjlogx, 0}, L = log(l//u) and define 

7j- = (l + 7)^'-l 

for all j ^"L.Yn what follows, we shall use [xj to denote the greatest integer 
less than or equal to x and let \x~\ = \x\+\. 
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Theorem 1. Suppose that Xq{0) = N with N = for some a> I 

and suppose that 7 G G{a), the set of generic parameter values defined in 
(1.2). Then, as fi^O, 

Y^{t) = \ log+(Xj(Lt/7)) yj{t) in probability 

uniformly on compact subsets of (0, t*), where t* = t*{a,^) > is defined in 
(1.1). The limit yj{t), which depends on the parameters 0,7, is deterministic 
and piecewise linear and will be described by (a) and (b) below. Furthermore, 
if we define 



for j >0, then 



tj =tj{a,-/) =mf{t:yj{t) = 1} 



7 + 1 

> tj in probability 



as /.i — )• for all j > 0. 

(a) Initial behavior. yj{0) = (a — j)^. The convergence only occurs on (0, 00) 
because we have Yj^ (0) = for all j > 1 by assumption, so a discontinuity 
is created at time 0. 

(b) Inductive step. Let sq = and suppose that at some time Sn > 0, the 
following conditions are satisfied: 

(i) rUn = max{j : yj{sn) = a} and A;„ = max{j : yj{sn) > 0} both exist 
and yj{sn) > for all m„ < j < kn, 

(ii) yj+i{sn) > Ujisn) — 1 for all < j < /c„ so that, in particular. 

Let /c* = kn if yk„isn) <l, kl = kn + l if ykS^n) = 1 and define 



5n 



{a - yj{sn)h/-fj-m„, rUn <j<k 



"'^ \ (1 -yfc-(Sn))7/7fc*-m„, j = kl. 

For all t < An = min{5„j : m„ < j < A;* }, we then have 



VjiSn + t) 



{yj{Sn) + t-fj^m„/l)^, j < kl, 

0, j > kl, 

and conditions (i) and (ii) are satisfied at time Sn+i = Sn + ^n- 

Our description of the limiting dynamical system can be understood as 
follows. If type m„ is the most fit of the dominant types in the population at 
time Sn, then the yj{sn + t), m„ < j < kn, grow linearly with slope 7j-m„/7 > 
0, while the yj{sn + 1), j < rUn, decrease linearly with slope ^j-m^h < 
until they hit zero and ym„isn + t) stays constant. These rates remain valid 
until either yj reaches level a for some m„ < j < kn and there is a change 
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in the most fit dominant type, or y^* reaches level 1 and individuals of type 
A;* + 1 are born. These two events correspond to A„ = 5nj and A„ = 6n,k* , 
respectively. The condition yj^i{sn) > yj{sn) — 1 guarantees that after birth, 
the growth of type-j individuals is driven by selection and not by mutations 
from type-(j — 1) individuals. If this condition failed, we would encounter 
a discontinuity in the limiting dynamics like the one at time 0. We have 
rescaled time by 7"^ since, in most cases of interest, 7 is small (e.g., 7 < 0.01) 
and when 7 is small, we have 7^/7 ~ j so that the limit process described 
above is almost independent of 7. 

Parts (a) and (b) together describe the limiting dynamical system for all 
times 

00 

(1.1) t<r = J]A„ 

n=l 

since by part (a), (i) and (ii) in (b) hold at time sq = 0, and it is easy to 
see from the form of yj{t) that if (i) and (ii) hold at time s„, then they also 
hold at time Sn+i = Sn + A„. Note that the form of the limit implies that 
at times t ^ {sn}n>Oi there is always a unique j such that yj{t) = a, that is, 
a unique dominant type. This observation will prove useful on a number of 
occasions. In Section 2 we shall see examples in which t* = 00, but in Section 
2.4 we will prove the following result. 

Lemma 1. For any 7 > 0, there exists a-y such that t* < 00 whenever 

Therefore, in general, our construction cannot be extended up to arbi- 
trarily large times. We prove this lemma by showing that if a is large, then 
an infinite number of types will be born before any existing type-j > 1 indi- 
vidual achieves dominance. However, since it is easy to see by construction 
that we have t* > tj for any j, this is the only way that blow-up can occur. 
Hence, the limit process will accumulate an infinite number of mutations 
before time t* , which means our approximation is valid on any time interval 
of practical importance. 

The generic set of parameter values is 

(1.2) G(a) = {7 € (0, 00) : 5„j / ,5„,i for alH / j, n > 0}. 

In other words, when 7 € G{a), there is always a unique jn such that A„ = 
Sn,jn- For given value of a, G{a) is clearly countable, so our result is 
good enough for applications. If 5n,i = Snj for some iy^j, then either (i) 
type i's and type j's reach level a at the same time or (ii) type i's reach 
level a at the same time that type j's reach level 1. It is tempting to argue 
that since generic parameters are dense, the result for general parameters 
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follows, but proving this is made difficult by the fact that the growth rates 
are not continuous functions of the parameters since they depend on m„ = 
max{j:yj{sn) = a}. 

Theorem 1 is very general, but not very transparent. In Section 2 we will 
give some examples in which more explicit expressions for the birth times 
tj are available. Figure 1 shows examples in the first three "regimes" of 
behavior that we will consider. In the jth regime, type m + j arises (but not 
type m+ J + 1) before type m "fixates," that is, is of order A^= f^""]. These 
regimes closely correspond to the different scenarios considered in [4], in 
which the "stochastic edge," that is, the class of the most fit mutant present 
at positive quantities, is always assumed to be q fitness classes ahead of the 
population bulk, q is referred to as the "lead." In the notation of Theorem 
1, the lead is always /c* on the interval and in regime j, the lead 

is always j. In all three examples in Figure 1, we see the traveling- wave-like 
behavior observed in the simulations of Beerenwinkel et al. [3] (see also [21]). 
The time between successive waves is constant in the example from regime 
1, while in the examples from regimes 2 and 3, the time between successive 
waves is not constant, but converges to a constant as the number of waves 
goes to infinity. 

1.2. Growing population. We now consider a growing population of in- 
dividuals N^{t), t > 0, with a random initial population in N = {1,2, . . .} 
distributed according to some measure fo- At time 0, all individuals are of 
type and we suppose that, in addition to the previously imposed Moran 
dynamics, at rate pN^^{t), p>0, new individuals are added and their type 
is chosen to be k with probability 



As in the case of fixed population size, we are able to derive a limiting, 
piecewise linear approximation to 



To determine the correct growth rates, suppose that there are xj individuals 
of type j and that the population size is A^. We then have the jump rates 



(l + 7)^Ant) 



Y^{t)^{l/L)\og^X^,{Lth). 



Xj Xj + 1 rate: [(1 -|- p)N — Xj] 



E.>o(l + 7)'^i 



p,Xj. 
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If mutations can be ignored, then the growth rate of type j's is 
E.>o[(l +/')(! + 7?- (1+ 7)i^^^.- 



E.>o(l + 7)'^i 



[(l+p)(l + 7r"^-lK- 



if Xi = o{N) for i ^ m (recall that in the limiting dynamical system from 
Theorem 1, there is a unique dominant type at time t for all but a countable 
number of times) . This yields the expression 

A,_„ = (l + p)(l+7F-"^-l 

for the limiting growth rate of type j's in a population dominated by type 
m. 

If type-j individuals have size (1/^)^ at time 0, are growing at rate ^e(j) 
for some £{j) > 1 and the initial total population size is (1/^)^, then type 
j's will achieve fixation at the approximate time t satisfying 

{l/fife^'U)t = (l/nyeP^ or t= / "^ log(l//»)- 

This leads to the following result. Theorem 1 corresponds to the special case 
p = 0. 

Theorem 2. Let Fi'it) = {l/L)logN^'{tL/-f) and suppose that F''{0) ^ 
a in probability for some a > 0. Then, for all 7 € G{a, p), the set of generic 
parameter values given in (1-3), as fj, ^ we have a + tp/^ and 

Y-^{t) yj{t) in probability uniformly on compact subsets of [0,oo) and 
{0,t*), respectively, where t* = t*{a,p,^) is given in (1.4)- The limits yj{t), 
which depend on the parameters {a,p,^), are deterministic and piecewise 
linear and described by (a) and (b) below. Furthermore, if we define 

tj = tj{a,p,j) =mf{t:yj{t) = 1} 

for j > 0, then 

— ; > tj in probability 

L/7 

as ^ — ?• for all j > 0. 

(a) Initial behavior. yj{0) = (a — j)^ ■ 

(b) Inductive step. Let sq = and suppose that at some time Sn>0 the 
following conditions are satisfied: 

(i) TUn = max{j = a + psn} and kn = max{j ■.yj{sn) > 0} both 
exist and yj{sn) > for all rUn < j < k^, 

(ii) yj^i{sn) > yj{sn) — 1 for all < j < kn so that, in particular, 
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Let an = a + />s„, A;* = /c„ if yfc„(s„) < 1, A;* = /c„ + 1 if yk„{sn) = 1 and 
define 

r J — yj{Sn 

"'^ 1 (1 - yfc- (sn))7/Afc*-™„, j = k*. 

For all t < A„ = min{(5„j- : m„ < j < /c* }, we then have 

y.(s If) = / +%-m„/7)"^, i<fcn> 

" ^ lo, i>fe;, 

and conditions (i) and (ii) are ScitisfiGd. Sit time Sn+i — 

+ A 

n ■ 

The generic set of parameter values is 
(1.3) G{a, p) = {^e (0, oo) : 6n,j + 6n,^ for all i^j,n>0} 

and, of course, 



(1.4) **^E^ 



1 



The argument which we use to prove Lemma 1 implies that t* < oo whenever 
p > since a„ — )■ oo as n ^ oo. 

An example is given in Figure 2. Since the population size is growing, 
we progress through the different "regimes" of behavior defined earlier for 
the fixed population size, and the time between successive waves of sweeps 
decreases. This behavior can also be seen in Figure 3 of [3]. Here we are 
dealing with the small mutation limit so that our waves have sharp peaks. 

Motivated by the statistical analysis of cancer data in [23], Beerenwinkel 
et al. [3] were interested in the time T^^ at which a cell first accumulates 20 
mutations. Their choice of the number 20 was inspired by data from [23]. 
Using heuristics they obtained the approximation 

^ '^-^log(iV(o)iv(^2^o)) 

for j < 20. Their model evolves in discrete time, but the heuristics only 
use the fact that the drift in the Wright-Fisher diffusion limit (ignoring 
mutations) is given by 

bj{x)^-fXj{j - {j)), 

where (j) = YlJ^j (^^^ [''']' ps-g^ 253). To get the same drift in continu- 
ous time, we need to rescale time by 2/A^, as opposed to 1/A^, and hence 
we should replace 7 by 7/2 and by to obtain the analogous approx- 
imations for the Moran model. The important point to note is that the 
approximation in (1.5) is linear in j and hence yields constant estimates for 
the increments Tj* — T^_i, whereas we can see that in the limiting dynam- 
ical system, the increments are not constant, but decrease in length as the 
population size increases. Figure 3 shows a plot of ij-i, the constants from 
Theorem 2, as a function of j and illustrates the nonlinearity in j. 
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Fig. 2. Plot of the Umitmg dynamical system m Theorem 2 (growing population size). 
Parameters: p = 0.0013, 7 = 0.01 and a = 1.2. 



2. Examples. In this section, we discuss some examples with constant 
population size (p = 0) in which the explicit computation of the times tj is 
possible. To do so, it is more convenient to study the increments 




Fig. 3. Plot oftj-i, the constants from Theorem 2, as a function of j . Same parameter 
values as in Figure 2. 
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Theorem 1 then imphes that 



T 



L/7 



where (3j = — tj_2 for ah j > 1 if we use the convention that t_i = 0. 
We begin with the first regime of behavior where the lead is always 1 and 
we have /3j = /3 for all j > 2. In Section 2.2 we move on to discuss regime 
2 where the lead is always 2. In this case, we will see that /3j depends on 
j, but we have /3j — ?> /3 as j — )• oo so that asymptotically, the times between 
successive waves is constant. Section 2.3 contains some conjectures on the 
parameter range for the regime j > 3. In all these scenarios, we conjecture 
that t* = oo and our limiting result holds for all time. In Section 2.4, we 
will prove Lemma 1, showing that for any 7 > 0, there exists such that 
t* < 00 whenever a> a-y. 

2.1. Results for regime 1. Let r2 = 1 +7/72- The first regime occurs 
for 1 < a < r2. If 7 is small, then 7/72 ~ 1/2 and the condition is roughly 
a e (1,3/2). If 7 > 0, then 7/72 = 1/(2 + 7) < 1/2, so a < 2 throughout 
regime 1. 

Table 1 summarizes the situation. To explain the entries, we note that 
applying part (a) of the limit description implies that yi(0) = a — 1, and 
part (b) then implies that 

yi{s) = {a-l) + s 
for s < Aq = 2 — a. Since we have assumed that a < r2, we have 

Ai = 5i 1 A 5i 2 = (a - 1) A — = a - 1, 

7 

and applying part (b) tells us that we have y2(si = ^72/7 for all t < ^i- 
Another application of (b) then yields A2 = (^2, 2 = ^ — y2{s2)-, which gives 
the additional amount of time needed for ?/2 to hit 1. Since the relative sizes 

Table 1 

Sizes in regime 1. Times are given in units of L/^, entries are the size 
given as a power 0/ and indicates when the first of the type is born. 
The first row comes from (a), the next four from applications of (b) 



Time 


Time increment 


Type 


Type 1 


Type 2 


Type 3 


0+ 




Q 


a - 1 






Sl 


Ao = 2 - a 


a 


1 







S2 


Ai = a - 1 


a 


a 


72(« - l)/7 




S3 


A2 = 1 — A172/7 




Q 


1 





S4 


A3 = a - 1 




Q 


a 


72(a- 1)/' 
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of I's, 2's and 3's at time ,33 are the same as the relative sizes of O's, I's and 
2's at time si, we obtain the following result giving the limiting coefficients 
ofr;. 

Corollary 1. Suppose that N = for some a E {l,r2). Then, as 



(2 — a), and for all j > 2, > 13 in probability, 



Lh ' " ' ^ - ' L/7 

where /3 = Ai + A2 = (a - 1) + 1 - (a - 1) ^ = (2 + 7) - (1 + 7)0. 

Figure 1 illustrates the limiting dynamical system in the case where 7 = 
0.01 and a = 1.3. We can see that in regime 1, the system is characterized by 
a "traveling wave of selective sweeps" in type space, that is, the growth and 
decay of types j > 2 occur translated in time by a fixed amount. In Figure 4 
we show the distributions of types at the times when type-5, -9, -13 and -17 
individuals are born (the times t^, t^, ti2 and tie from Theorem 1). As we 
move from time tj to tj^4, the distribution is shifted by a constant amount. 

2.2. Results for regime 2. Regime 2 occurs for r2 < a < r^ with = r2 -|- 
7/73. When 7 is small, 7/73 ~ 1/3, so this regime is roughly a € (3/2, 11/6). 
In general, < 11/6, so we have a <2 throughout this regime. As in the 
previous section, it is easiest to explain the conclusions of Theorem 1 with 
a table; see Table 2. 

Since a <2 the first two rows are the same as in regime 1, and we again 
have 

rf- (2-a)L/7. 
However, we now have a > r2 so that 

Ai = di^i A (5i,2 = (a - 1) A (7/72) = 7/72, 
and hence the 2's reach level l//i before the I's fixate. This yields 

rr ~ — • — . 

72 7 

Now 2/1(52) = 1 -|- 7/72 = ^^2) so the additional time it takes yi to reach level a 
is 62,1 = a — r2. Since a < r^, we have (a — r2)^zh < 1 ^^"^ hence ^2,1 < (52,3, 
that is, the I's will fixate before the 3's reach level l//i. To show that the 
I's fixate before the 2's and conclude that A2 = 52,i = ot — r2, we need to 
show that (a — r2) < (a — 1)7/72, which holds if and only if 

(2.1) a<l±l. 

^ ' 1 + 7 
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2 4 6 8 10 12 14 16 18 
Type of Individual (j) 




2 4 6 8 10 12 14 16 18 
Type of Individual (j) 



2r 




Type of Individual (j) 



Fig. 4. Distribution of types at the times ti, ts, ti2 and ti^ (from left to right) with the 
same parameters as Figure 1 (top to bottom). 
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Table 2 

sizes in regime 2, time in units 0/L/7 



Time Time increment Type 1 Type 2 Type 3 



0+ 






Q - 1 






Sl 






1 







S2 


Ai 


= 7/72 


r2 


1 





S3 


A2 = 


- a — r2 


a 


1 + A272/7 


A273/7 


S4 


A3 = ^ 

72 


(1-A2^) 




1 + A272/7 + A3 


1 



However, comparing r^ = l + 7/72 +7/73 with the upper bound in (2.1), we 
can see that 

1^ / ^ / ^2 + ^ ^ (3 + 37+V) + (2 + 7) 1 
1 + 7/72 + 7/73 < — ^77- — , o — ^ — 2^ < 



1 + 7 (2 + 7)(3 + 37 + 72) I + 7 

5 + 97 + 57^ + 7^ ^ 

6 + 97 + 572+73 

The last inequality is always true and therefore (2.1) holds throughout 
regime 2 and A2 = a — r2, justifying the fourth line in Table 2. Finally, 
to check that the 2's have not yet fixated when the 3's reach level 1/ ji and 
prove 

A3 = <53,3 = -fl-A2^ 

72 V 7 
we note that the size of 2/2(^3 + ^^.2) is 

1 + A272/7 + h;i = l + —{a-r2) + — -—{a-r2) 

7 72 72 

= l + 7/72 + £(a-r2) 

with 

/ / (2 + 7)2 -(3 + 37 + 72) 1 + 7 , 

£ = 72/7 - 73/72 = ^ ^— ^ ^ = — ^ € 0, 1 , 

2+7 2+7 

and hence 2/2(53 + (^3,3) £ {'f2iOt). This justifies the final line of Table 2, and 
we conclude that 

'^3' ^ A _L A _L 7(1 -73(a-?^2)/7) 

A2 + A3 = a - r2 H . 

L/7 72 

In contrast to regime 1, the relative sizes of types when the 3's reach 
l//i are not exactly the same as the relative sizes when the 2's reach level 
l//i. To describe this more complicated situation, suppose that type-(A: — 2) 
individuals have size (1/^)^ at the time type-(A: — 1) individuals reach level 
l//i. Then, if we assume: 
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Table 3 
Iteration in regime 2 



Time increment 


Type fc - 3 


Type fc - 2 


Type fe - 1 


Type k 


t]. — a ~ X 

4 = i(l-73(a-a;)/7) 


a 
a 


X 

a 
a 


1 

1 + 724/7 
/(^) 




734/7 
1 



(2a) type k — 2 reaches fixation before type k — 1; 

(2b) type k — 2 readies fixation before k's reacfi 

(2c) type k readies level before type k — 1 reaches fixation, 

we can repeat the arithmetic leading to Table 2 to yield Table 3, where here 
fix) = 1 + 72^/7 + tl = r2 + e{a - x) with £ = {1 + 7)/(2 + 7), as before. 

Since the density of 2's is /(r2) when the 3's have reached size l/n, we see 
that when type fc > 3 reaches size 1///, the density of type /c — 1 is f^~'^{r2)- 
This leads to the statement of our next result. 

Corollary 2. Suppose N = for some a S {r2,r^). Then, as 0, 

— 7 > (2 — a), and for all j > 2, — y Bj in probability, 

where ^2 = 7/72, o-'^^d if we let f^{x) = x, then for all j >3 we have 

ft = t] + = („ - f-Hr.)) + l-(3 + 3...2("-/"W) . 

J J z + 7 

Furthermore, the coefficients (3j (3^ as j —)■ 00, where 



2 + 7 

with r* = limj^oo /^'(r2) = {r2 + £a)/{l + 1). 

Proof. We need to show that conditions (2a)-(2c) above are satisfied 
for any j > and that f^{r2) converges. The latter follows from the fact 
that / has slope with (. € (0, 1), so, as j 00, 

,ji N , * r2+la 
f^{r2)^r =^-— ^, 

the unique fixed point of /. It is easy to see that £ G (0, 1) implies that 
(2.2) r2<f'{r2)<a 
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for all j > 0, and (2c) immediately follows. Since a < r-^, we have 73(0 — 
< which, along with (2.2), tells us that (2b) holds for all j > as 
well. Finally, (2a) is equivalent to 

a — 1 OL — f^~^{r2) 
72 7 

and so (2.2) implies that to prove (2a), we need only show that 

a — 1 a — r2 

72 7 

Rearranging terms, we obtain the equivalent condition a < (2 + 7)/(l + 7) 
which holds by (2.1), completing the proof. □ 

Again, the behavior of the limits yj{t) can be read from Tables 2 and 3. 
The formulas are messy, but it is easy to compute yj{t) for a fixed value of 
a. As Figure 1 shows, after a short transient phase, the increments between 
the appearance of successive types settle down into the steady-state behavior 
guaranteed by Corollary 2. Figure 4 shows the distribution of types at various 
times throughout the evolution of the system, which agree with simulations 
given in Figure 1 in the Appendix of [3]. 

2.3. Regime j, j > 3. Regime 3 occurs for a G (r-Sjr^) with r4 = + 
7/74. When 7 is small, 7/74 ~ 1/4, so this regime is roughly a E (11/6, 25/12). 
If a <2, then the initial phases are similar to regime 2, but now type 3 
reaches before the I's fixate; see Table 4. 

If we now assume that: 

(3a) type k — 3 reaches fixation before types k — 2 and k — 1; 

(3b) type k — 3 reaches fixation before type /c's reach l/fi; 

(3c) type k reaches level before types k — 2 and k — 1 reaches fixation, 

then the recursion in Table 3 becomes a pair of equations (see Table 5). 
To imitate the proof in regime 2 we would have to show that (3a)-(3c) 
hold for X = rs and y = 1 + 72/7, and for all of the iterates /^(x,?/), where 



Table 4 

I'^Si/jj sizes in regime 3, time in units of L/j 



Time 


Time increment 


Type 1 


Type 2 


Type 3 


0+ 




a-1 






Sl 


Ao = 2 - Q 


1 







S2 


Ai = 7/72 


r2 


1 





S3 


A2 = 7/73 




1 + 72/73 


1 
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Table 5 
Iteration in regime 3 



Time increment 


Type fc - 4 Type fe - 3 


Type fc - 2 


Type fc - 1 


Type fc 


t]. — a — X 


a X 
a a 

a. 


V 

fii^,y) 


1 

1 + 473/7 
f2{x,y) 


474/7 
1 



/ = (/i,/2). Figure 5 shows that this is true when a = 1.95 and 7 = 0.01; 
however, verifying this algebraically is difficult because f{x,y) may fail to 
satisfy the conditions when does. 
In general, we conjecture that if we define 

j 

i=l 

then we are in regime j if a € [rj, rj+i) and we have /3j — )• /3oo as i — > 00. In 
particular, this would imply that t* = 00 as long as 

00 

a<roo = ^i7/'yj). 

i=l 

However, as Figure 6 shows, the converse is not true. There, we have a = 
3.2, 'J = 0.1 so that a > Too ~ 3.1, but it appears that we still approach a 
constant increment between waves. 




1.85 1.9 1.95 2 2.05 



X 

Fig. 5. Successive iterates of the two-dimensional map 

(x,y) I— >■ f{x,y) = {fi{x,y), f2{x,y)) given in Table 5, started with initial conditions 
{xQ,yo) = (1 + 7/72 + 7/73,1 + 72/73) and parameters a = 1.95, 7 = 0.01. The star 
denotes the fixed point of f. 
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Fig. 6. Plot of the limiting dynamical system in Theorem 1 (fixed population size). Pa- 
rameters: 7 = 0.1, Q = 3.2. 



2.4. Blow-up in finite time. In this section we prove Lemma 1, which 
shows that for any 7 > 0, we can choose a large enough to make t* finite. 
To this end, let 7 > 0, and for all j >1 define 

00 

and let S = S{'y) = sup{S'j(7) : j > 1}. Note that since 

l-(l+7n ^f^ 1 1 

' (l+7)^-(l + 7)-^' -^^(1+7)* l-(l + 7)-^-*' 

we have 5 < 00. With this notation to hand, we can prove the lemma. 

Proof of Lemma 1. Fix 7 > 0, define S = 5(7) as above and choose 
a large enough so that a> 1 + 25" and 

7a/2/7a < 1/5*, 

where a = [a\ . We will show that Ai = (1 — (a — a))^/^a and A„ < 7/7a+n_-i 
for n > 2 so that 

00 00 
= ^ A„ < 7 ^[(1 + 7)-+- -!]-'< 00. 

n=l n=0 

To prove that A„ has the desired bound, we will show that for all n > 0, 
Ua+nit) hits level 1 before yj{t) hits level a for any 1 < j < a + n. Since this 
implies that the growth rate of type (a + n)'s is 'ja+n/j, we have, in the 
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notation of Theorem 1, /c*=a + n — 1 for all n > 1 so that Ai = di^k* = 
(1 - (a - a))-f/^a and A„ = 6n,k* = 7/7a+n-i for all n > 2. 

We first note that since yj{0) = (a — for all j > 0, we know that no 
type 1 < j < a — 1 can reach level a before type- (a — 1) individuals reach level 
1. Now, let n > and suppose that no individual of type 1 < j < a + (n — 1) 
reaches level a before type- (a + {n — 1)) individuals reach level 1. Then, for 
a-\-l <j <a + n, type-(j — 1) individuals reach level 1 at time 

j-a-l 

ij-i = (1 - (a - a))7/7a + Yl ' 

i=i ^'^+* 

at which time type-j individuals are born and start to grow at rate 7^/7. If 
there is no change in the dominant type, then type j's will reach level a at 
time j^/jj if J < a and time -|- a^/^j if j > a, so if we define tj = for 
all j < a, then the proof will be complete if we can show that 

(2.3) min(j,a)7/7j > (Wn-i - tj-i) +^/-fa+n 

for all 1 < J < a + ra. Suppose first that 1 < j < a/2. Our choice of a then 
implies that 

n 

la la la+i 

SO that 

n 

1 . . 1 



jilllj) > > ta+n-1 + 

If a/2 < j <a, we have 



i=0 ^'^+* la+n 



j>a/2>s>y^^>y^^ 

so that, again. 



7 

j{l/lj) >ta+n-l + 



la+n 

Finally, if a + l<j<a + n, we have 

00 

a>S>Y > ilj/l)ita+n-l - tj-1 + l/la+n), 

i=0 



which completes the proof of (2.3). □ 
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3. Ideas behind the proof. For the remainder of the paper, we suppose 
that p > and that (0,7) GGp. C will always denote a constant that does 
not depend on /i and whose value may change from line to line. We begin 
with a simple, but useful, lemma which explains why the limiting result for 
the birth times follows from the limiting result for the sizes. 

Lemma 2. Let e,b>0. Then, 



as /X — 7> for any j > 0. 

Proof. Let Mj{t) denote the number of mutations from j's to {j + l)'s 
by time t and let 



as /i — 7- 0, yielding the result. □ 

Assuming we have the uniform convergence of Y-^{t) — )• yi{t) for all i > 



the desired convergence of Tj'^^ follows. 

Our next result gives an approximation for the population size N^{t) 
that yields the desired uniform convergence of and also proves useful 

in other situations. 

Lemma 3. Let (,a>0. Then, as fi^O, 



Proof. Let Nj{t), <t < aL, be a family of i.i.d. pure birth (Yule) 
processes in which individuals give birth at rate p and the initial population 
is Nj{0) = 1. We then have 



P{Tj^+i < Lt,X!^{Ls) < 6(l//u)^~^ for aUs<t)^0 



A{t) = {X^{Ls) < b{l/n)^~' for aU s < t}. 



Since mutations to {j + l)'s occur at rate fiXj{t), we have 

E[Mj{Lt);A{t)] < hLtn^ 



as /i — 7- and therefore Chebyshev's inequality implies that 
P{M,j{Lt) > l,A{t))<E[Mj{Lt);A{t)]^{) 





N^{t)^ N,{t). 
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It follows, for example, from [2], page 109, equation (5), that the moments 
(t) = E{N'j{t)), i = l,2, satisfy 

and so Mj{t) = e~P*Nj[t) — 1, t > 0, is a mean-zero martingale ([2], page 
111) with 

var(Af,(t)) = -|^-l<C. 
Since the Mj are independent, 

M{t)= J2 Mj{t) 
i=i 

(which is itself a mean-zero martingale) has 

E{Mf{t)) = wai{Mj{t)) < CN^'{l^). 
Applying Chebyshev's inequality and the L^-maximal inequality yields 



P\ sup 

\0<t<aL 



1 



<P( sup \M{t)\>CN^\0) 

c 

< —: -— — )■ as u — > 0, 

which gives the desired result. □ 

There are four steps involved in proving the desired convergence of Yj{t), 
j > 0, in Theorem 2. The first step, taken in Section 4, is to prove a result 
about the initial behavior of the process. 

Proposition 1. Let k = [a\ be the largest integer < a and define 

e ^/j7/(Ai-p), j<k, 
l(l-(«-A;))7/Afc, j = k. 
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Then, for any < ti < t2 < Aq = mm{5oj : j < k}, Yj^{t) — )■ yj{t) in proba- 
bility uniformly on [ti,t2] with 

{a-j)+tXj/j, j<k, 
0, j > k. 



Proposition 1 yields the correct initial conditions (a). The proof of the 
inductive step (b) is given in Section 5 and has three main parts that together 
roughly describe how the limit changes during one iteration of (b), that is, 
on the interval Since we wish to apply the results below to Y-^{t) 

at positive times, we consider a version of our Moran model in which we 
allow for general initial conditions X''(0) satisfying the following. 

Assumptions. As 0, F^(0) ^> a > and ^^''(0) in probabil- 

ity for all j > 0. Furthermore, we suppose that the > 0, satisfy the 
conditions: 

(i) there is a unique value of m with y^ = a; 

(ii) there is a A; > such that = for all j > k, yj > for m< j < k 
and < 1; 

(iii) yO+i>yO-l for 0<i<A;. 



Define 



a 



y^j - ^)l/{^j-m- p), m<j<k, 



5f = r 



^ 1(1-2/°- ^h/ ^k-m, j = k 

and let Ag = min{5| ■.m<j<k} for e >0. For j >0 and t < Aq, define 



yj(.t) 



{y^ + tXj^m/lV, 3<k, 
0, j > k. 



To connect the next three results below back to (b), we will use Proposition 
2 to describe the limit on the intervals [sn + e, Sn+i — £'] for small e, e' > 
and use Propositions 3 and 4 to describe the limit on [sn+i — e',s„+i +e], 
depending on which of the following two possible outcomes occurs: (i) Aq = 
5^ and a new type is born, or (ii) Aq = 5^ some n G (m, k) and there is 
a change in the dominant type. 

Proposition 2. Let e > and suppose that the above assumptions hold. 
Then, Y-^it) yj{t) in probability uniformly on [0, Ag] for all j > 0. 

Proposition 3. Suppose that the above assumptions hold and that Aq = 
61- Fort<£, let 

v(A^ + t) = l ^yj(^o) + tXj-mh)'^, j<k + l, 
^ \0, j>k + l. 
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There then exists ei = Si{y^) > such that for all j + Y-^{t) — )> yj{t) 
in probability uniformly on [Ag, Aq +e] and 

P( sup Yk+i{t)-{t-A,/2)Xk+i-m/l>e/2)^0, 

^A^./2<t<A()+e ^ 

(3.1) P( sup Yk+iit) 

^Ao+(7/Afc_„Oe/2<t<Ao+£ 

- (i - Ao - (7/Afc-,n)e/2)Afc+i_„,/7 < -e/2) ^ 
as fi^ 0, provided e < ei. 

Proposition 4. Suppose that the above assumptions hold and that Aq = 
6^ for some n € (m, k). For t <e, let 

yj{Ao + t) = ( (2/i(Ao) + tXj^nh)^, j < k, 
[0, j > k. 

There then exists £2 = ^2(2/") > such that Yj^{t) — > yj{t) in probability uni- 
formly on [Ag, Aq + e], provided e < 82- 

Note that n 7^ m + 1 is possible (see Figure 6). 

Proof of Theorem 2 from Propositions 1-4. Suppose that X^(0) = 
A^'^(O) and Xj(0) = for ah j >l, and let yj{t) denote the dynamical sys- 
tems described by (a) and (b). Let i^T be a compact subset of (0,t*), ( > 
and take a € (0, Aq), n{K) > 1 so that [a, Sni^x)] ^ where s„ is as defined 
in (b). Choose s > small enough so that e < ei(y(s„)), e2(y(sn)) for all 
n < n{K), where £1,62 are as in Propositions 3 and 4, respectively. Without 
loss of generality, suppose that e < C/(c + 1) , where c = 0(7, p) > 1 is defined 
below. We also set Sn,e = Sn — £7/Aj*_^_m„_n where j* satisfies A„ = Sn,j*- 

By Proposition 1, we obtain Yj{t) — )■ yj{t) in probability uniformly on 
[a, si^s] ■ Suppose now that we have uniform convergence on [fl, '-'n,£] fo^ some 
n < n{K) — 1. We then have two cases to consider. If j* = j for some j G 
(mn, kn), then applying Proposition 4 up to time s„ + e and then Proposition 
2 with 2/° = yj{sn + e) for all j up to time Sn+i,e^ we obtain the result. If 
J* = then Proposition 4 clearly allows us to extend uniform convergence 
for Yj{t), j ^ kn + 1 up to time Sn + e. To do this for j = + 1, we first 
apply Proposition 2 to get convergence up to time s„ £/2- Write 

Yjit) - Vjit) = {Yj{t) - (t - s„,£/2)Afc„+i-mj7) 

+ ((* - Sn,£/2)Afc„+l-m„/7 - %•(*))• 
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Recalling that yj{t) = iit<Sn and yj{t) = {t- s„)Afc^+i_m„/7 if < t < 
Sn + £, we can see that 

h - y,{t) G [0, (Afe„+i_„JAfe„_„,Je/2] c [0,ce/2] 

for all Sn,e/2 ^ i ^ Sn + the last inclusion following from the fact that 

A,+i/Afc = ((1 + p)(l + 7)'^+^ - 1)/((1 + P){1 + if - 1) 

<((l + /,)(l + 7)'-l)/((l + p)(l+7)-l) 
= c 

for all A; > 1. Since Proposition 3 implies that Yj{t) — (t — s„ e/2)^fc„+i-m„ /7 < 
e/2 for all Sn,e/2 ^ ^ ^ Sn + £ with high probability and c > 1, we obtain 

P( sup Yl!^^{t) - yk+i{t) > {c+l)e/2) ^ 

as ^ ^ 0. To prove the lower bound, we note that Yk+i{t) — yk+i{t) > 
for t < Sn, yk+i{t) < ce/2 for ah t < Sn + (7/^fc„-m„)e/2 and, by a similar 
argument to the one above, using the second equation in (3.1) instead of 
the first, Yk+iit) - yk+i{t) < -(c+ l)e/2 for all s„ + (e/2)(7/Afc„-mJ < t < 
Sn + £ with high probability. Therefore, 

P( sup y,%(t)-2/fc+i(i)<-(c+l)e/2) ^0. 

Since e <C/{c + 1), we conclude that 

P( sup \Y^^^^{t)-yk+i(,t)\>c)^0 

as ^ — )> 0, so we have convergence up to time Sn + £■ Finally, to complete 
the proof of the inductive step, apply Proposition 2 with y^ = yj{sn + e) to 
extend the convergence up to time Sn+i,e- D 

4. Initial behavior. In this section we prove Proposition 1 concerning the 
initial behavior of the limit, but before we can begin, we need to take care 
of some preparatory details. We set No = {0, 1, . . .} and for x = {xo,xi, . . .) € 
M^o , we write x^'^ = x + Cj — e^, where the Cj € M^" , J > 0, are the standard 
basis vectors. It is useful to note that we can define {[N^^{t)^X^^{t))}t>o as 
the Markov process with state space 




and initial population (iV^ (0) , (0) ) = (iV^ (0) , (iV' (0) , 0, 0, . . . ) ) with ^ (0) 
distributed according to vq in which (A^, x) ^ (N, y) at rate Pj^k{x) + p6j^i^kXj-i 
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if y = x^''' for some j, k >0, {N, x) i— )■ {N + 1, y) at rate pN{l + jyxj/w if 
y = x + Cj and {N,x) i— )• {M,y) at rate otherwise, where Jj^fc here denotes 
the Kronecker delta symbol and 

Pj,k{x) = - — w = y^{i + -fyxi. 

i>0 

We let 

6°(x) = piV(l + -fyx^/w + = Y^PjA^) 

denote the birth and death rates, respectively, of type j's, ignoring muta- 
tions, and drop the O's when the mutation rates are included. = (^{^^ {s) '■ s < 
t}, and unless otherwise explicitly stated, when we say a process is a martin- 
gale, submartingale, etc., it will be with respect to the canonical filtration 
We will also use the notation 

j>0 J 

to denote a particular cross section of our state space S. 

For convenience, we will assume for the remainder of this section that 
A^^(O) = Our first lemma, which is similar in spirit to Lemma 2, takes 
care of the limits for j >k + 1. Recall that Tj^^^,-^ = min{t : X^^-^(t) > 0}. 

Lemma 4. If k = [a\, then P{T^^^ < Lt/"/) as fj. ^ for any t < 
So,k- 

Proof. Since type j's are born at rate bj{x) and die at rate dj{x), we 
have 

j^EX^{t)=E{b,{X^{t))-d,{X^m- 
Using Yl Xf{t) = Nf{t) and (1 + 7)* > 1 for i > 0, we have 

E,->nf(l + + 7P - (1 + iy]X!"(t)Xl(t) 

h,(x^{t)) - d,{x^{t)) = T.J{i + ^Yx?{t) 

(4.1) +^,{x^^,{t)-x^{t)) 

<X,X^{t) + fiX^^,{t) 
for any t > 0. Thus, for j > 1, we obtain 

j^EXf{t)<X^EX^{t) + ^^EXf_,{t) 
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so that integrating both sides yields 

EXf{t) <fi [ SXj_i(s)e^^(*-^) ds for j>l. 
Jo 

We claim that induction now implies 

(4.2) EXf^{t)<Cj{l/n)''-^e^J\ 

To prove this, we note that EX^{t) < EN''{t) = (l//i)"e''* [recall that A^''(t) 
is just a Yule process], so the result for j = holds with Cq = 1. Using the 
induction hypothesis and integrating, we have 

Jo 

< C7j_i(l/^r-^e^^* I'e-'^^^-^^-^^' ds, 
Jo 

which proves the claim with Cj = Cj_i/(Aj — Aj_i). 
From (4.2), it follows that 

r EXf{s) ds < C7(l//i)"-Je^^*. 
Jo 

In particular, taking t < 5o,fc = 7(1 — (oi — k))/Xk, we have 

(4.3) / EXj^is) ds < C(l//x)i-('^o,fc-t)Afc/7. 
Jo 

The rest of the proof is the same as the proof of Lemma 2. □ 

To obtain the appropriate limits for j < k and complete the proof of 
Proposition 1, we will couple Xj{t), j < k, with upper- and lower-bounding 
branching processes Zj^{t) and Zjg{t), respectively, so that Zj^{t) < Xj{t) < 
Zj^{t) up until some stopping time a, which will be greater than Lt/^ with 
high probability for any t < Aq, and will then show that we have 

(l/L)log+Z,-,(Lt/7)^yj(t) 

in probability uniformly on [ti,t2] for any < ti < i2 < (see Lemma 7). 
The coupling is made possible by applying the following result to bound the 
birth and death rates of type j's on the interval [0, Aq]. 

Lemma 5. Suppose that x € and that there exist m,M ^ Nq, r] > 
such that (i) ^j^Lj^Xj < fi'^N and (ii) xj = for all j > M . For all j ^ m, 
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we then have the inequalities 

(1 + P - + ^y~^x^ ^ ^0^^^ ^ (1 + p)(i + ^y- 



L__^<4(.)<.„ 

where = -y^i^rniM - m)ff and h^ = -j^rn'mfi'^. 
Proof. From the definition, 

: (1 + p)N - Xj 



b^^{x) = xj{i + ^y 

= x,-(l + 7) 



{l + p)-x,/N 
l + Ei[(l + 7)'-™-l]x^/iV' 



To get the lower bound, drop the terms in the denominator with i < m, 
which are < 0, and use the fact that j 7^ is increasing. For the upper 
bound, drop the terms with i>m. The death rates are given by 

so the upper bound is trivial. The lower bound follows in the same way as 
the lower bound for b^{x) once we write 



.1 ^ ^ J AT \ \ \ f 1 I _A7. — m. 11™ — 



We now describe the bounding processes. Let < ti < t2 < 

V = r]{t2) = ^^^{Ao-t2). 

The reason for this choice of 7] is that 

yj{t) < {a + tp/-f)-4r] 

Table 6 

Rates for the comparison branching processes, j >1 

Birth rate . ''^TlT"' - (i+p-.;M(i+.)^- 

Death rate d^^ = = 1 + /i 



Immigration rate jiZ^^-^^ ^{t) fiZ'^_-^ ^{t) 



TRAVELING WAVES OF SELECTIVE SWEEPS 



29 



for all t < t2, i > 1- For our bounding processes, we set ZQ^u{t) = N^{t), 
^o/i) = (1 - kfj.'^)N^{t) and let Zj^^, I < j < k, a = u,e, be (birth and 
death) branching processes with rates given in Table 6, taking m = 0, M = k. 
Note that the birth and death rates are per particle. The extra factor /x in 
the definition of takes care of deaths due to mutations. We also set 
X^^ = bj^ — dj^ to be the growth rates of Zj^, a = u,£, so that we have 
Xj^ ^ Aj as ^ — )■ for j > 1, a = u,£. If we use the convention that Aq ^ = /? 
for a = u,i, this also holds for j = 0. 

For the next result, we use the notation Za{t) = {Zq ^{t), Z'^ ^{t), . . . , Z^^{t), 
0, . . .), for a = u,i. 

Lemma 6. There exists a coupling of X^(t) with Za{t), a = u,£, such 
that 

Z^/t)<X^{t)<Z^Jt) 
for all t < (La/j) A T^j^^, j < k, where 

a = mf{t > : y.^(t) > a + tpf-f - 2r] 

for some i>l or \F'^{t) — (a + tp/^) \ > ij}. 

Proof. For t < T^^-^, we have Xj{t) = if j > k. Furthermore, if i < 



{l/pYeP^ 



so that 



iVM(Li/7) - (1//^)" ^ 

for all J > 1 and hence we have the bounds on birth and death rates given 
in Lemma 5 with m = and M = k. The processes can therefore be coupled 
in an elementary way by matching birth, deaths and immigrations in the 
appropriate manner. □ 

The result which we will dedicate most of the remainder of this section 
to proving is the following. 

Lemma 7. Let Q <ti <t2 < 5q. For a = u,i and j < k, we have 
{l/L)\og^Zl^{Lth)^y,{t) 
in probability uniformly on [ti,t2]- 
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Because y{t) < (a + pt) — 4ry for all t <t2, Lemma 7 implies that 

P((l/L)log+Z^(Lt/7) < {a + pt)-2r],yt<t2,a = u,i)-^l 

as /i — 7- 0. This and Lemma 3 imply that P{a > 1 as p—^0 and there- 

fore Proposition 1 follows from Lemmas 7, 6 and 4. 

To prove Lemma 7, we begin by defining another level of upper and 
lower bounds, Zj^, in which immigrations occur at deterministic rates. More 

specifically, for a = u,i, we define Zj^(t) as a branching process with the 
same initial population and birth and death rates as Zj^{t), but with im- 
migrations at rate filj^{t), where 

and 

We will use the convention that -^q a(^) = ^ ^- Note that 

(4.4) E{e-^Uzl{t)) = pj\-K.^ll^{s)ds 

for all J > 1 and a = u,i, an expression which will be used often throughout 
the remainder of this section. 

Lemma 8. For j > and a = u,i, 

is a martingale with respect to the filtration 

ga,t = <r{Z^As):0<i<j,s<t}. 

Proof. We prove the result for a = u, the proof for a = i being similar, 
and drop the subscripts u from all quantities for the remainder of the proof. 
It is easy to see that 

EiZ^{t + h)\gt) = e^"''Z^{t) + E(^fij'^\^"'^'+''-'h^{s)ds\gt^ 

and multiplying by e~^^ (*+'^) gives 

E(^e-^^('+''^Z^\t + h)-e-^"'Z^it)-pj^'^\-^"'I^{s)ds\g?j =0. 
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Since (4.4) implies that 
M^{t + h)-M^{t) 

= e-^^(*+'^)z;(t + h)- e-^^Wzj'(t) - j'^^^ e-^^^lj'(t) ds 

for j > 1 and the same equahty clearly holds for j = as well, the desired 
result, S(Mj'(t + h)- M'^{t)\gt) = 0, follows. □ 

Lemma 9. For all a = u,i, T > and fi sufficiently small, we have 
pfsup|MfJt)| > <C7^(°-J')/3[l + /,("-i+i)/3]. 

In particular, for all j <k, 
as ^ — > 

Proof. The second part of the result follows directly from the first, 
along with the definition of Mj'^{t). To obtain the first part, we suppose for 
the remainder of the proof that u = a and drop the subscript u. The proof 
for a = £ is similar. We will also assume that j > 1 and leave the (simpler) 
j = case to the reader. 

We proceed by calculating the variance of e~^rz^{t) and then using the 
maximum inequality to bound the second moment of Mj'{t) uniformly 
on [0,T]. To begin, we claim that provided we choose /i small enough so 
that > A^__-|^ for all 1 < i < j, we have 

(4.5) <7(t)(l/M)("--'') < i?(e-^'*Z;(t)) < C[(l^)("-^^) + ;ui/=^(l//i)2("-^-)/3], 

where g{t) is continuous on [0,oo) and positive on (0,oo). To see this, we 
note that EZo{t) = EN^^it) = (l/^)"e''*, so the result clearly holds for j = l 
by (4.4) and the general case follows by induction on j. Now, 

lEie-'^^Z^it)r 

= -2A';S(e-^i*Zf(i))' + e"2\'*^[6^Zf (t)(2Zf (t) + 1)] 

(4.6) 

- d'^e-^^j 'E[Z!^{t){2Z!^{t) - 1)] +/i/j'(^)e-2^^*^[2Zj'(^) + 1] 
= (6; + d';)e-^^^'EZ^{t) + Aii;(i)e-'^'* + 2/ii;(t)e-'^^*£;z;(t). 



32 R. DURRETT AND J. MAYBERRY 

Equation (4.4) implies that 

Jo Jo 

so that integrating both sides of (4.6) and applying (4.5) yields 

ft ft 
var(e-^^*Zf (t)) < (fe'; + 4) / e^^^> EZf^s) ds + / /x/f (s)e^^^^' 

(4.7) ^° 

< C[(l/^)(°-J') + /il/3(l//x)2("-j)/3]. 

By Lemma 8, Mj' is a martingale with respect to Qt and so the maximum 
inequality implies that 

E(sup{M!f{t)f'^ < AE{Mlf{T)f = AYs^Tie'^^ Z!^{T)), 

the second equality following from the definition of Afj*. Applying Cheby- 
shev's inequality and (4.7) then yields 

pfsup|Mf(t)| > (l/;Li)2("-j)/3) <C^f(°-j)/3[l + ^(«-i+l)/3], 

completing the proof. □ 

Corollary 3. For a = u,£, there exists a coupling of the process Za 
with Za such that 

z^/t) < z^it) < z^jt) < zl^it) 

for all t <a, where 

a = inf{t > : Jt) > /j^Jt) or Z^^^/t) < I^/t) for some j > 1}. 
Furthermore, P{(7 <t2) ^ as /i — )• 0. 

Proof. Arguing inductively, we can see that the immigration rates for 
type j's in Z^^Z^^Zu and Z^, respectively, satisfy 

Illicit) < fiz^^,/t) < fiZ^_,^^{t) < f^q^it) 

for t <a. Therefore, we define a coupling for the two processes by coupling 
births, deaths and immigrations. The fact that P{a <t2)^0 follows from 
Lemma 9. □ 

Define Yj^ai^) = (1/-^) 1°S^ Zj^{Lt/^). Lemma 7 follows from Corollary 3 
along with our next result. 
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Lemma 10. Let j <k, a = u,£. Then, Y^^{t) — )• yj{t) in probability uni- 
formly on [ti,t2]- 

Proof. Again, we only prove the result for u = a and drop the u sub- 
script. Let j <k and write 

= ((l/L)log+[e"^'^*/^Z;(Lt/7)] 

(4.8) 

-(l/L)log+i^[e-^'^*/^Z;(Lt/7)]) 

+ ((l/L)log+i?[e-^^'^*/^Z;(Li/7)] - (« - j)) + - A,)t/7. 
By Lemma 9, 

- EZ^^^{t)\ < e^'*(l//i)2("-^-)/3, Vt < t2) ^ 1, 
and on the set where 

- EZ^Jt)\ < e^'^(l//.)2(-"^-)/3 Vt < t2, 

we have 

(l/L)log+[e-^'^^/^Z;(Lt/7)] - (l/L)log+i^[e-^'^*/^Z;(Li/7)] 
1, Z^{Lt/^)-EZ^iLth)\ 



.log 1+ ' ' 

L \ EZ^iLtM J 

^ c \z';{Lth) - Ez';{Lth)\ ^ c (i/^)2(°-^-)/3 ^ 

- L Ezi;{Lth) - L [l/^iY-i ^ 

uniformly on [ti,i2] as /i — ?• 0, the last inequality following from (4.5) and 
the fact that g{t) is bounded away from on [ii,f2]- Therefore, the absolute 
value of the first term on the right of (4.8) goes to zero uniformly on [ii,i2]- 
It is clear from (4.5) that the second term goes to as well and since — >■ Aj 
as /X —7- 0, the result follows. □ 

5. Inductive step. In this section we prove Propositions 2-4. We shall 
assume throughout that the assumptions from Section 3 hold and begin with 
the proof of Proposition 2. The reader should refer to the statement of that 
result for the notation used throughout this section. 
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5.1. Interior convergence. Let e > 0, set aj{t) = a + tp/j for j ^ k, 
ak{t) = 1 and choose rj = rj{e) > so that: (i) yj{t) < aj{t) — 2rj for all t < A^, 
j 7^ m; (ii) yj-i{t) — yj{t) <l — 2r] for all t < A^, j > 0. Given C > 0, we define 
the stopping times 

^o(j)^7r;/L, 

aiU) = mf{t > : Yf{t) > a,(t) - ??}, 
ai = inf cri(i), 

o"i = inf cri(j), 

o-2(j) = inf{i > : Y/^iit) - Y^it) > 1 - ry, for some 1 < i < j}, 
a3(j,O^mf{t>0:y/(t)<C}. 

For the remainder of this section, set = ao{k + 1) and (72 = (T2{k). We 
shall prove convergence of Yj^{t) up to time (T(j, C) = ctq A cri A (T2 A 0-3(7, 0- 
For types j <k, this will essentially amount to controlling the infinitesimal 
variance of Yj^ (Lemma 11) and then showing that the infinitesimal mean 
converges to the appropriate limit (Lemma 12), while for types j > k we 
will simply show that they are unlikely to be born before time LA^/j, that 
is, o-Q > Ae with high probability (this follows from Lemma 2). We then 
complete the proof of Proposition 2 by using the structure of the limit yj{t) 
to extend convergence up to time A^, as required. If yj{t) is bounded away 
from 0, then this is easy since our choice of r/ implies that 0-1,0-2 are unlikely 
to occur before time A^ and if yj{t) is not bounded away from (which can 
only happen if j < m), we will define a stopping time a' such that Yj^ (t A a') 
is a supermartingale to conclude that once the j's drop below a certain level, 
they will never climb up again. 

The first step is to calculate the infinitesimal mean and variance. Writing 
yj = (1/L) log(xj), y = {yo,yi, . . .) and N = Y^e^y% noting the time rescal- 
ing and using the fact that the change in yj when Xj jumps to Xj ± 1 is 
(1/L) log(l ± ajj^), we can write the infinitesimal mean of Yj^{t) as Bj{y) = 
Bj^riy) + -Bj,m(y), where 

Y .,.(l + -yYe^y' 

L»>o(l + 7)'e^^' 

Bj^^ = fij-^e^y^-' log(l + e-^y^) + fij-^e^y^ log(l - e'^^O- 

In words, Bj^j-iy) is the rate of change due to death and subsequent replace- 
ment, while Bj^^{y) is the rate of change due to mutations. Similarly, the 
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infinitesimal variance is 



My) = {l/L) 



7 



+ /i7-^e^^^-»(log(l + e-'^J'O) V A^7"^e-^^' (log(l - e-'^J'O) 
Introducing = xlog(l + /2(a;) = xlog(l — x~^), 

and 

. . lE^^i(l + 7)^ 
*''^^^"7E.>o(l + 7)^e^^^' 

we can write 

+ /i7-Vf(e^^0e^^^-'"^^^^' + M7-V|(e^^0e"^^^]. 

Since gj, < (1 + /))(! + 7)^7, OjM^) < V7 and /2(x) < < 1 for all 
X e [0, 00), we obtain the bound 

(5.1) Ajiy) < (C/L)((l + ^)e-^% +//e^^^--2^^0- 

Define 

M,(t) = Yfit) - Yf{0) - [b,{Y^{s)) ds. 
Lemma 11. For any ^ > and j > 1, 



P[ sup \Mj{t)\>n ^0. 

Proof. Since Fj^i(i) - < 1 - 77 for t < a2, we have 

e-^^'W<l and ^e^^'-^-^^^''^ < C^^ 
and therefore the result follows from (5.1) and Corollary 2.8 in [16]. □ 

Our next step is to show that the infinitesimal means converge to the 
appropriate limit. The key to the proof is that bj(Yj^{t)) Xj-ml for all 
t < but we write out the details carefully because we will need (I)- 

(IV) from the proof several times in what follows. 
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Lemma 12. If j and (,^> 0, then as — > 0, 



P[ sup 

vt<<^{i,C) 



t 

Bj{YNs))ds-Xj.„,t/j 







Proof. Using the definition of fi, gj^i, i = l,2, we write 
BjAv) = + /2(e^^05,,2(2/) 

and 

We will complete the proof by proving the following four facts: 

(I) for any C > 0, /i(e^^/(*)) ^ 1 and ^(e-^^/'^*)) ^ -1 in probability 

uniformly on [0, cJ3(j, C)]; 
(II) gj^2(Xj^('t)) — ^ 1/7 in probability uniformly on [0,cJi(j)]; 

(III) for any C > 0, 

P(sup g,,i{Y^{t)) > (1 + p)(l + 7)i-"V7 + C) ^ 

and, furthermore, gj^i{Yj^{t)) {1 + p){l + jy~'^ /j in probability uni- 
formly on [0, (To A di]; 

(IV) Bj^fj,{Y'^{t)) in probability uniformly on [0,cr2(j)]. 

(I) follows immediately since /i(x) — > 1, /2(x) — )> — 1 as a; ^ 00 and Y^{t) > C, 
on [0, (T3(j, C)]. To prove (II), write 

1/ (1+7)^6^2'^ 



5,,2(y) = i(l 



7 V Ei>o(l + 7)'e^^^ 
and note that if ^ e^^' = iV and yi < a, then 

< < (1 + lye'^y^lN < C{l/^iT/N. 

E»>o(l + 7)*e^^' 

Now, Lemma 3 and the assumption that -F^(O) a imply that N'^{Lt/^) > 
(1 /^()Q+tp/7~r,/2 fQj. g^ii ^ < ^^^Yi high probability so that since e^^ti^) = 
Ni^{t) and Y^{t) <a + tpj^i -r],\it< ai{j), (II) follows. For (III) we note 
that if ^ e^^' = A^, then using the definition of jj = (1 + 7)-' — 1, we have 

^_ ii + ^y-"' f i + p-e^y^/N 

7 Vl + E.^.7.-™e^^V^. 
The first part of (III) then follows from the fact that 

7i-me^^ViV > ^ ji-me'-y^/N > 7_„(l/^)7iV 
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if yi ^ o for all i <m, while the second part follows from the fact that we 
also have 

/ 1 + \ ^ l + p-{l/iiY/N 

Vl + E.^m7i-me^J^VA^y " l + {k-m)-ik-m{l/pY/N 

if Uj < a for all j <k and yj = for all j > k. Finally, to prove (IV), we use 
the bound 

so that since f2{x),fi{x) < 1 for all x > 0, the result follows from the fact 
that 

if Vj-i - < 1 - □ 

Proof of Proposition 2. Lemma 2 imphes that if we have the result 
for j < k, then P(<to < A^) 0, so it suffices to prove the result for j < 
k. Lemmas 11 and 12 and the assumption that l^^(O) — )• in probability 
together imply that Yj{t) — )• yj{t) in probability uniformly on [0,(T(j, C)] for 
any > 0. Note that since yi{t) < ai{t) — 2r} for all t < A^, i ^ m, by our 
choice of rj, we have P{ai < A do A 0-2) — ^ 0. Furthermore, yk{t) < 1 — e 
for all t < Ag and hence P{cro < A^ Aa2) by Lemma 2. Therefore, we 
obtain uniform convergence on [0, A^ A (T2 A (7s{j, ()] for any C > 0. We will 
show that the convergence is uniform on [0, Ag A (72]. Proposition 2 follows 
since yj-i{t) — yj{t) <1 — 2r] for all t < A^, j > 0. 

Suppose first that j > m. There then exists C > such that yj{t) > C for 
all t < Ag and hence we obtain uniform convergence on [0, Ag A 0"2(j)]. The 
same argument applies if j < m and yj{t) is bounded away from on [0, Ag]. 
If j < m and yj{t) is not bounded away from on [0, A^], set 

Cr' = (Jo A (Ti A (T2, 

let > be small and choose a time Iq < A^ such that yj{to) = If 
no such time exists [i.e., yj{t) = for all t], set to = 0. Then, yj{s) < 
for all to < s < Ae and 

P( sup \Y,{t)-yj{t)\>c) 

<P( sup \Yj{t)-yj{t)\>c)+P( sup y,(tA(j')>C(l + C/4)) 

(5.2) 

<P( sup |F,(t)-?/j(t)|>c)+P(>S(toA(T')>«/2) 
+ P( sup y/(tA(7')>C(l+^/4)|y/(toAa')<a/2). 
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The argument in the last paragraph imphes that Yj^{t) Ujit) uniformly 
on [0, tQ A a'] and hence the first and second terms on the right-hand side of 
(5.2) are each < for all fi sufficiently small. To control the third term, 
we note that (II)-(IV) from the proof of Lemma 12, along with the bounds 
fiix) < 1, f2{x) < — 1 for all X >0, imply that if n is sufficiently small, then 
BjiY^'it)) < for all t < a' with high probability so that y^(t A a') is a 
supermartingale. Therefore, 

P(^^ max A a') > C(l + C/mfito A a') < a/2) < < C/2. 

Since C,^ were arbitrary, this proves that Yj(t) — )• yj{t) in probability uni- 
formly on [0, Ae A a'] . Since we have already shown that P{(Tq V cji < CJ2) — ^ 0, 
this completes the proof. □ 

5.2. Birth of a new type. In this section, we prove Proposition 3. Note 
that ^^(Ag) = 1 — e for small e since Aq = 5^ and choose e = £{y^) small 
enough so that the limiting dynamical system satisfies ?/j(Ao -|- t) < a -|- 
tp/j — 2?7, j ^ m, and yj„i(t) — yjit) < 1 — 2r] for all j > 0, t < e and r] 
sufficiently small. Since the result for j ^ k follows from the arguments used 
to prove Proposition 2, we only need to prove (3.1). To explain these in- 
equalities we note that our limiting process has ?/fc+i(Ao + 1) = Xk+i-mt/j 
and yfc+i(Ao — t) = for small t. However, when t is small the number of 
type- (A; + 1) individuals is small and deterministic approximations are not 
valid. The best we can do is to say that Yj^_^^{t) cannot get too far above the 
line with slope Xk+i-m/7 that starts just before time Aq [the first inequality 
in (3.1)] or too far below the line with slope Xk+i-m./l that starts just after 
time Aq (the second inequality). 

We begin by defining branching processes Zj^^-^ ^(t), a = u,£, with initial 
populations Z^_^-^^ ^(0) =0 and per particle birth and death rates given by 
Table 6 in Section 4, but with immigrations at rate e^*-™*. The methods 
used in the proof of the next result closely parallel the methods used to 
prove Lemmas 8~10 in Section 4. 

Lemma 13. Let <ti <t2. Then, 

in probability uniformly on [ti,t2] as for a = u,i. 

Proof. We prove the result for a = u and drop the subscripts u from all 
quantities. For ease of notation, we will also write Z{t) = Zj^,^{t) but leave 
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the /i superscript on A^^-,^ to distinguish it from Xk+i = (1 + /o)(l + 7)^^ — 1. 
Notice that 

_ {l+p){l + ^y-rn + 

^^+1 l + 7_„m/i^ l + 7fe+i_^(A; + l-m)/x^ ''^^ ™ 
as /i — >• 0. 

Define M(f) = e"^fe+i*2'(t) - S(e"^fe+i*Z(t)). The same argument in the 
proof of Lemma 8 then implies that M{t) is a martingale (with respect to 
the cr-algebra generated by Z{s), s <t). Furthermore, we have 

(5.3) E(e-^^+i*Z(t)) = (1 - e-(^5:+i-^^-)*)/(A^+, - X^-m) 

and a similar argument to the one used to prove (4.7) in Section 1 implies 
that 

var(e^^fe+i*Z(t)) < C. 

Prom the L2 maximum inequality and Chebyshev's inequality, we can con- 
clude that 

P( sup M(s) >Li/2\ ^0 

as /i — >■ 0. This yields a result analogous to Lemma 9 in Section 4. The 
conclusion of Lemma 13 then follows using the same argument as in the 
proof of Lemma 10 since -^^-i-i ^ -^fc+l— m ^ ^h—m 

as /i — )■ and so (5.3) 

implies that there exist ci , C2 > such that 

ci < sup£;(e"^'*='+i*Z(s)) < C2 

for all f > if /X sufficiently small. □ 

Proof of Proposition 3. Suppose that e<e\= eAl/(2(Afc+i_m/AA;-m + 
Afc+i-m/T))- Let ?7 > and define 

^1 = - yk{t)\ < e/4,l^/(t) <a + pt-7],yj< k,j y^m,t <Ao + e}, 

^2 = {T,^H.l>A3,/4L/7}, 

As = {T,^+2 > (Ao + e)L/7,y,';,(t) < 1 - r?, Vt < Ao + e}, 

A = AinA2nA3. 

Note that P{A1) — > by Propositions 2 and 3 applied to j 7^ k, while 
P(A2) — by Lemma 2. Recalling that 

yfc(A3£/4 + 1) = 1 - 3e/4 + t\k-mh, 

we have 

X,^(L(A3,/4 + t)h)e-^^'-^'/^ < l/p 



40 R. DURRETT AND J. MAYBERRY 

for all t < (Aq + e) — ^se/^ on A. Therefore, using the bounds on the birth 
and death rates given in Lemma 5, we can couple Xj^_^-^{L{A-^i,/4 + t)/j) with 
■^fc+i u^^^hl)^ a = n, in a similar manner to Lemma 6 so that on A, we have 

^fe+l(^(A3./4 + t)h) < Z^,^,^^{Lth) 

for all t < Aq + e — Lemma 13 then yields the first limit in (3.1), 

provided we are on A. However, we then have 

< 1 - r/, Vt < Ao + e} n yl) ^ 1, 

and it follows from this that P{A^) — t- for small r], which proves that the 
first limit in (3.1) holds. To prove the second limit, we use the fact that 

yk{Ao + (e/4)(7/AA,.-„) + 1) = 1 + e/4 + tXk-m/l 

to conclude that 

Xj^{L{Ao + (e/4)(7/Afc„„,) + t)/7)e"^'=-'"^*/^ > l/fi 

for all t < e(l — 7/(4Afc-m)) on A. Hence, we can couple X^_^-|^(L(Ao + 
(e/4)(7/Afc_„0 + t)/7) with Zji^^/Lt/j) so that 

X^_^,(L(Ao + (e/4)(7/Afc_^) + i)/7) > Zji^i/Lt/j) 

for all t < e{l — j / {4:Xk~m)) on A and the second part of (3.1) again follows 
from Lemma 13. □ 

5.3. Change in dominant type. In this section, we prove Proposition 4. 
We begin with some notation. Let 



(yO + A,_^V7)+, iij<k, 
0, if j > k. 



Note that yj{t) < yj{t) for all t with equality if i < Aq. As in Section 4.1, 
let aj{t) = a + pt if j ^ k, = 1 and choose e = e(y°) and 77 > so that: 
(i) yj{t) < aj{t) - 2r] for all t< Aq + e, j ^ m,n; (ii) yj-i{t) - yj{t) < 1 - 2r/ 
for ah t < Ao + e; (in) yj{t) >a + pt — ?//4 for all Ag <t < Aq + e, j = m, n. 
Without loss of generality, suppose that i] < {a + A^p/^)/4:. Let (7i{j), i = 
0,1,2,3, be as in Section 4.1 and set 

aij) = ao{k + 1) A ( min ai{i)) /\ Mj) /\ (T3{j) . 

Our first lemma sets the stage for the battle between types m and n by 
showing that all other types remain smaller than these two. 

Lemma 14. // j ^m,n and ^ > 0, then 

P( sup {Ynt)-yj{t))>A^O. 



Proof. 
12. □ 
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This follows directly from Lemma 11 and (I)-(IV) from Lemma 



Note that Lemma 14, Lemma 3 and our choice of e imply that 

(5.4) .up ^^^<,^ 

for all j ^ n^m with high probability. Furthermore, X/^^Lt/^) < (l//i)^~^ 
for all t < Aq + £ with high probability, and hence Lemma 2 implies that as 

(5.5) P(r,^V,<(Ao + e-)L/7)^0. 
Let 



x;(LA,/7 + t) 



be the fraction of j's in the population at times greater than LlS.^/^. Then, 
as a consequence of (5.4) and (5.5), we have 

(5.6) < 1 - {Rm{Lt/j) + Rn{Lt/j)) < {k + l)^'? 

for all t < Aq + £ — A^ on a set A with P{A^) 0. Our next result concerns 
the change of power from m's to n's. To state the result, let 

/ [r) — r(l — r 



1 + -in-mT 

and define fji^), j = n,m, as the (random) solutions to the initial value 
problem 

^ = -/(I -r^^) = /„«), 

(5.7) 

with initial conditions r^(0) = Rj{0), j = n,m. 

Lemma 15. There exists an £2 = £2{y^) > such that for j = n,m, 

P( sup \R^^{Lth)-r^;{Lth)\>^i^l^ 
V<Ao-A,+£ 

as /.i — 7> for all e < £2- 
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Proof. We will prove the result by calculating the infinitesimal mean 
and variance of Rj{t). Without loss of generality, we assume that e < e so 
that by (5.4)-(5.6), we have 

^ R^{t)<ik + l)f,^, 

(5.8) 

0<l-(i?(^(t)+i2^(i))<(A; + lK 

for ah t < L(Ao + e - on a set A with as /i 0. Note 

also that Lemma 3, the fact that A^'^ is nondecreasing and our choice of 
?7 < (a + Ai;p/^)/4 together imply that 

(5.9) N^{t)>C{l/pr+'^^p/^~">C{l/pf'^ Vt>LAj7 

on a set A with P{A^) as p 0. We will therefore assume that the 
inequalities in (5.8) and (5.9) hold for the remainder of the proof and write 
0(/u^) for any quantity whose absolute value is bounded above by C/i^ 
uniformly for t < L(Ao + e — A^)/^ on a set A with P{A'^) ^ as 0. It 
all also convenient to write 

= Ao + e - As = (1 + 7(A„„^ - p)-^)e. 

By looking at the rates for the chain {N'^{t), Xf^it)), the fraction Rj{t) has 
the following jump rates corresponding to the events Xj/N {xj + l)/N, 
Xj/N ^ {xj - 1)/N, Xj/N ^ {xj + l)/(iV + 1) and Xj/N ^ Xj/{N + 1), 
respectively: 

(I _|_ -yV'r, 

rj ^ rj + 1/N rate: A^(l - rj)- '-^-^ + pNrj_i; 



w 



w — {1 + 7)''Vj 



rj I— )• r j — rate: Nrj + pNrj; 



w 



rj ^ rj + (1 - rj)/{N + 1) rate: pN^ ^f^^ 

W — ( 1 ~h '^r f^i 

rj ^ rj - rJiN + 1) rate: pN ^ 

w 

where w = X^j>o(l +7)*'"i- From these expressions for the rates, we can see 
that the infinitesimal mean of R^ is given for r G /N by 

^ (l+7yr,d-r,) ^ _ (^..(--(l+TFr,) ^ 

■w \ w 

pN (l+7)^rj(l-rj) piV rj(w - (1 + 7)Jrj) 
N + 1 w N + 1 w 
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Similarly, the infinitesimal variance is given by 

^ l/(l+7yr,,(l- r,) ^ ^ ^,(.»-(1 + tV^,,) ^ 

piV / (l + 7yrj(l-rj)2 r|(w; - (1 + Tpr^) 



_pr^(l-r^ 

(AT + 1)2 + 

where, in the second line, we have added and subtracted prjN/{N + 1)^ 

from the first. Note that (5.9) and the fact that rj S [0, 1] together imply 
that 

(5.10) A;(ii^(s)) = 0(/i3'7) 

for all s < Lce/j. 

Now, (5.8) implies that 

wiR^^is)) = (1 + jrR^is) + (1 + jTR^is) + 0(/x^) 

= (l + 7r[l + 7n-m<(.)]+0(/i'') 

for all s < Lc^/j and hence 

Bn{R''{s)) 



1 + 



iV^'(LA£/7 + s) + 1 



iVA'(LAe/7 + s) + ly V l + ln-mRin[s) J 

= UR^^{s))+0{p^) 

for all s < Lce/^-, the last equality following from (5.9) and the definition of 
fn- Similarly, writing 

w{R^^) = (1 + ^TRif, + (1 + 7)"i?^^ + 0{p^) 

= (1 + 7r[l + 7n-n^(l - ^^J] + O(^i^), 

we obtain 

B^{R^^{s)) = fm{R^^{s)) + 0{pP) 
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for all s < Lcs/'j. Combining this with (5.10), the fact that \f'{rj) \ < 7(1 + 7) 
for all rj € [0, 1] and the proof of Theorem 2.11 in [16], we obtain the result. 
(Theorem 2.11 in [16] applies directly if we replace r^(t) with rj{t), the 
solution to (5.7) with initial conditions rj(0) = lim^_!.o -R^ (0), but it is easy 
to see that the same proof applies if we use rj{t) since rj{t) is the solution 
to (5.7) with random initial conditions Rj{0).) □ 

The next step is to analyze the differential equations for j = m,n in 
Lemma 15. We will carry out the analysis for j = n [for j = m, apply the 
analysis below to 1 — rm(s)]. To begin, write 

with 

, X A„„m(l - r) 

9n{r) = — . 

1 + 7„_mr 

Note that we have the following set of bounds on the growth rate gn '■ 
(1 - 2 ^ 9nK) < Xn-m when r^, < L-2; 

1 + 7n-m 

(5.12) 

when < < 1 - L'^; 
0<9nK)<L~^Xn^m when < > 1 - L^^^ 

Lemma 16. Let s^* = inf{s :r^(s) > L'^} and ^ = inf{s:r^^(s) > 1 - 
We then have sf/{L/j) ^ Aq for i = 1,2 and (s^ - s^)/L ^ as 

Proof. Let 

X^(LAJ7) , X^jLA,/^) 
'^'^'>~ m{LA,hf ^"""^ '^"^'^"iVM(LA,/7)^ 

where = (1 — L~^)(l + 7„_m-^~^)~^- It is then clear from the first bound 
in (5.12) that 

r,^(5)<<(.)<r(:(.) 

for ah s < s'^. Since Yn{As) Vni^e), ^''(A.) -^a + A^p/j by Proposition 
2, letting and Su be the times that and hit I/"^, we have Sa/{L/j) 
Aq as fi ^ for a = i,u, which proves the result for i = 1. To prove the 
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result for i = 2 we use the bounds in the second hne of (5.12) along with the 
fact that the logistic dx / dt = f3x{l — x) rises from to 1 — in time 
(4//3)logL, to conclude that 

So — s^* C log L 
Ij Ij 

as /i — 7- 0, which completes the proof. □ 
Lemma 17. 

(1/L) \og+[N^^{Lt/^)rfiLth)] ^ y,(A, + t) 
uniformly on [0, T] for any T > 0, j =n, m. 

Proof. We prove the result for j = n. Write 

(1/L) log+[iV'^(Lt/7)r;(Lt/7)] - y,(A, + t) 

rLth 



{9nK{s))-ln{s))ds 



where in{s) = ^n-m'^s<{Ao-Ae) ■ The first term in brackets converges to 
in probability by Proposition 2. To control the second term, split up the 
integral as 



+ I + 



Jo Js'^At Jsl^At 

Using the bounds in (5.12) and applying Lemma 16, we conclude that each 
of these integrals is o(L), which yields the result. □ 

Proof of Proposition 4. Let e < £2 and suppose first that j = ?n,n. 
Writing 

Yf{t)-y,{t) = {l/L)log+R^{Lt/j) - {1/ L)log+ rf{Lt/j) 
+ {l/L)log+[N^^{Lt/j)r^{LtM]-yn{t), 

we can see that since 

J - iVM(LA,/7) - ^ 

for all t > with high probability as /i — )• by our choice of e, the result 
follows from Lemmas 15 and 17. Suppose now that j 7^ m,n. If j > k, the 
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result follows from (5.5), so it remains to prove the result for j < k. In view 
of Lemma 11, it suffices to show that 



P 



sup 

t<{Ao+e)Aa(j) 



B,iYf{s))ds-y,it) 







and then follow the argument from the proof of Proposition 2 to yield the 
result. However, now that we have proven that Proposition 4 holds for j = 
m,n, we can argue as in the proof of Lemma 12 to conclude that 



f (i + pKi + t)^ 



0<t<Ao, 
Ao<t<(Ao + e), 



uniformly on compact subsets of [0, (Aq + e) A a^j)] — {Aq}. This replaces 
the second part of (IV) from the proof of Lemma 12, and the result follows 
after using (I)-(III) from the proof of Lemma 12. □ 
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